Generalized and transferable patient language representation for phenotyping with limited data

نویسندگان

چکیده

The paradigm of representation learning through transfer has the potential to greatly enhance clinical natural language processing. In this work, we propose a multi-task pre-training and fine-tuning approach for generalized transferable patient representations from medical language. model is first pre-trained with different but related high-prevalence phenotypes further fine-tuned on downstream target tasks. Our main contribution focuses impact technique can have low-prevalence phenotypes, challenging task due dearth data. We validate pre-training, fine-tune models including 38 circulatory diseases, 23 respiratory 17 genitourinary diseases. find increases efficiency achieves consistently high performance across majority phenotypes. Most important, almost always either best-performing or performs tolerably close model, property refer as robust. All these results lead us conclude that architecture robust developing numerous

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language Modeling with Limited Domain Data

Generic recognition systems contain language models which are representative of a broad corpus. In actual practice, however, recognition is usually on a coherent text covering a single topic, suggesting that knowledge of the topic at hand can be used to advantage. A base model can be augmented with information from a small sample of domain-specific language data to significantly improve recogni...

متن کامل

Language Modeling for limited-data domains

With the increasing focus of speech recognition and natural language processing applications on domains with limited amount of in-domain training data, enhanced system performance often relies on approaches involving model adaptation and combination. In such domains, language models are often constructed by interpolating component models trained from partially matched corpora. Instead of simple...

متن کامل

The Transferable Belief Model for Belief Representation

A survey of the use of belief functions to quantify the beliefs held by an agent, and in particular of their interpretation in the transferable belief model.

متن کامل

Using Generalized Language Model for Question Matching

Question and answering service is one of the popular services in the World Wide Web. The main goal of these services is to finding the best answer for user's input question as quick as possible. In order to achieve this aim, most of these use new techniques foe question matching. . We have a lot of question and answering services in Persian web, so it seems that developing a question matching m...

متن کامل

A Model for Project Selecting with Limited Resources in Data Envelopment Analysis with Input and Output Fuzzy

In Evaluating Performance, Selecting a Subset from a Set of Solutions with Limited Resources is Essential. If There Is More Than One Input and Output, the Data Rnvelopment Analysis Optimization Models Are Evaluated and Performance Measurement Based on the Weighted Output Is Divided Weighted Input. In This Research, Two Models of Optimization with Limited Resources Present from Data Envelopment ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Biomedical Informatics

سال: 2021

ISSN: ['1532-0480', '1532-0464']

DOI: https://doi.org/10.1016/j.jbi.2021.103726